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Abstract 


Escalation is the fact that in a game (for instance in an auction), the agents 
play forever. The 0,1-game is an extremely simple infinite game with intel¬ 
ligent agents in which escalation arises. At the light of research on cognitive 
psychology, it shows the difference between intelligence (algorithmic mind) 
and rationality (algorithmic and reflective mind) in decision processes. It also 
shows that depending on the point of view (inside or outside) the perception 
of the rationality of the agent change, which we call the principle of relativity. 

Keywords: economic game, inhnite game, sequential game, extensive game, 
escalation, speculative bubble, coinduction, auction. 


To Bernard Maris 

23 September 1946 — 7 January 2015 


That “rational agents” should not engage in such [es¬ 
calation] behavior seems obvious. 

Wolfgang Leininger m 

I can calculate the movement of the stars, 
but not the madness of men. 

NewtorQ in (1720) 

Sequential games are the natural framework for decision processes. In this pa¬ 
per we study a decision phenomenon called escalation. Infinite sequential games 
presented here generalize naturally sequential games with perfect information and 
have been introduced by Lescanne |18j and Lescanne and Perrinel [22] and formalize 
what is proposed in the literature m- Sequential games are games in which each 
player plays one after the other (or possibly after herself). Here we prove, using 
coinduction on a simple example, namely the 0,1 game, that escalation is not irra¬ 
tional. More precisely, in escalation, agents exhibit only the low part of the rational 
mind, namely the algorithmic mind, assimilated to intelligence or fluid intelligence 
to be more specific (see Stanovich [5S]). Since intelligent behavior contrasts with 
the somewhat obvious observation that escalation is irrational, we state a principle 
of relativity which says that “the view of the insider (the agent) is not the same 
as the view of the outsider (the observer)”, a well known fact in computer science 
when studying distributive system [13) . In addition, the 0,1 game has nice proper¬ 
ties which make it an excellent paradigm of escalation, a good domain of application 
of coalgebras and coinduction and a very nice opportunity to present coinduction, 
a tool designed to prove properties of complex systems. 

^Actually probably apocryphal Newton’s view on the outcome of the South Sea Bubble. 
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1 The problem of escalation 

Escalation in sequential games is a classic of game theory and it is admitted that es¬ 
calation is irrational. Consider agents able to reason formally that is making choices 
which are optimal and robust, in other words choosing equilibria. In finite sequential 
games, a right choice is obtained by a specific equilibrium called backward induction 
(see Appendix). More precisely a consequence of Aumann’s theorem [J says that 
an agent takes a good decision in a finite sequential game if she makes her choice 
according to backward induction. In this paper we generalize backward induction 
into subgame perfect equilibria and we explore the kind of reasoning obtained when 
built on subgame perfect equilibria (SPE in short). Such an appropriate reasoning 
is said to be based on an algorithmic mind (see Section [T]). 

What is escalation? In a sequential game, escalation is the possibility that 
agents take adequate decisions forever without stopping. This phenomenon has 
been evidenced by Shubik |34j in a game called the dollar auction. Without being 
very difficult, the analysis of the dollar auction is relatively involved, because it 
requires infinitely many infinite strategy profiles indexed by n S N |22j . At each 
step there are two and only two equilibria and therefore two potential intelligent 
decisions for the agents, namely “stop” or “continue”. If both agents choose always 
“continue”, an escalation occurs. In this paper, we propose an example which is 
simpler theoretically and which offers infinitely many infinite equilibria at each step 
unlike the dollar auction. Due to the form of the equilibria, the agent has no clue 
on which strategy is taken by her opponent. 

What is coinduction? Wikipedia gives the following definition of coinduction: 
“In computer science, coinduction is a technique for defining and proving properties 
of systems of concurrent interacting objects. ” Actually coinduction is more than a 
technique, since it is a formal approach to correct reasoning on infinite structures 
and to complex systems which applies far beyond computer science, especially to 
economics. First traces of coinduction can be found at the beginning of the XX*^ 
century when researchers tried to set a mathematical foundation to not well-founded 
sets [23] . It was revisited by Peter Aczel |2] creating the foundation of coinduction. 
Davide Sangiorgi |3T] gives a historical account of the field. Notice that Lawrence 
Moss and Ignacio Viglizzo have already apply coalgebras to the theory of normal 
form game [24] . 

Escalation and infinite games. Books and articles [9] (TT] [161 [25] cover 
escalation. Following Shubik, all agree that escalation takes place and can only 
take place in an infinite game, but their argument uses reasoning on finite games. 
Indeed, if we cut at a finite position the infinite game of the dollar auction in 
which escalation is supposed to take place, we get a finite game, in which the only 
right decision is to never start the game, because the only backward induction 
equilibrium corresponds to not start playing. Then the result is extrapolated by 
the authors to inhnite games by making the size of the game to grow indefinitely. 
However, it has been known for a long time at least since Weierstrafi [JD], that the 
“cut and extrapolate” method is wrong (see Appendix), or said otherwise, there 
is no continuity at infinite. For Weierstrafi this would lead to the conclusion that 
the infinite sum of differentiable functions would be differentiable whereas he has 
exhibited a famous counterexample. In the case of infinite structures like infinite 
games, the right reasoning is coinduction. With coinduction we were able to show 
that in the dollar auction escalation can be the result of a formal reasoning [^ |5D]. 
Currently, since the tools used generally in economics are pre-coinduction based. 
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they conclude that bubbles and crises are impossible and everybody’s experience 
has witnessed the opposite. Careful analysis done by quantitative economists, like 
for instance Bouchaud m n], have shown that bursts (aka volatility), which share 
much similarities with escalation, actually take place at any time scale. Escalation 
is therefore an intrinsic feature of economics. Consequently, coinduction is probably 
the tool that economists who call for a rethinking economics [HIEIEt] are waiting 
for [41]. 

Structure of the paper This paper is structured as follows. In Section [2] we 
present infinite games, infinite strategy profiles and infinite strategies, then we de¬ 
scribe the 0,1-game in Section [3| Last, we introduce the concept of equilibrium 
(Sections m and Ej) and we discuss escalation (Section |6|). In Section Owe discuss 
the actual rationality of the agents from a cognitive science point of view. In an 
appendix, we talk about finite 0,1-games and finite strategy profiles. Bart Ja¬ 
cobs m. Jan Rutten [3D] and Davide Sangiorgi [35] propose didactic introductions 
to coalgebras and coinduction. This paper is a deeply modified version of m- 


2 Two choice sequential games 

Our aim is not to present a general theory of coalgebras or a theoretical foundation 
of infinite extensive games. For this the reader is invited to look at [21101115]. But 
we want to give a taste of infinite sequential gamet0 through a very simple one. This 
game has two agents and two choices. To support our claim about the intelligence 
of escalation, we do not need more features. 

Assume that the set P of agents is made of two agents called A and B. In 
this framework, an ‘infinite sequential two choice game’ has two shapes. First, it 
can be an ending position in which case it boils down to the distribution of the 
payoffs to the agents. In other words, an ending game is reduced to a function 
/ : A I—/a, S I—/b and we write it (/). Second , it can be a generic game with a 
set Choice made of two potential choices: d or r (d for down and r for right). Since 
the game is potentially infinite, it may continue forever. Thus formally in this most 
general configuration a game can be seen as a triple: 


9 = 

where p is an agent and and gr are themselves games. The subgame gd is for 
the down choice, i.e., the choice corresponding to go down and the subgame g^ is 
for the right choice, i.e., the choice corresponding to go to the right. Therefore, we 
define a functor (see [3D] page 4 and following): 

():X -> -(- PxXxX. 

introducing a category of coalgebras of which Game is the final coalgebra and where 
P = {A, B}. In other words. Game satisfies the isomorphism 

Game ~ + P x Game x Game. 


Example 1 Here is a picture of a typical finite sequential game: 

^If the reader feels that this approach is not formal enough, she (he) can look at the “ultra”- 
formal approach found in the COQ scripts mentioned in Section [5] 
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If we write the full decomposition of ggi we get: 

ggi = {A I, B ^ 8), {A ^ A, B 7)), {A ^ 2, B 0)). 


Example 2 


■■>■ A 


0,1 1,0 


is a picture of an infinite game which will be studied more formally in Section 0 
Notice the dotted arrow ■■■■>• which shows the start of the game. 
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Example 3 The game which is solution of the equation 

9 = {A,{B,9,9),{B,9,9)) 

is a game infinite in both direction, down and right, alternating agents A and B. 
Notice that it has no leaf and that payoffs are not attributed. It can be pictured as: 


r 



Example 4 


® ®~® 

f )- )- y y y 

0,100 95,0 -5,95 90,-5 -10,90 85,-10 


r 



-5n,100-5n 100-5(n + l),-5n 


is the dollar auction game with bids of 5^. 


2.1 Strategy profiles 

From a game, we can deduce strategy profiles (later we will also say sometimes 
simply profiles) obtained by adding a label, at each node, which is a choice made 
by the agent. In a two choice sequential game, choices belong to the set {d,r}. 
Therefore a strategy profile which does not correspond to an ending game is a 
quadruple: 

S = {{p,C,Sd,Sr)), 

where p is an agent (A or B), c is a choice (d or r), and, Sd and Sr are two strategy 
profiles. The strategy profile which corresponds to an ending position has no choice, 
namely it is reduced to a function ((/)) = ((A >->• /a,B /b))- The functor 

(( )) : X ^ M'" + P X Choice x X x X. 

where 

P = {A,B} 

Choice = {r,d} 

introduces a category of coalgebras in which the coalgebra StratProf of infinite 
strategy profiles is the hnal coalgebra. Hence StratProf satishes the isomorphism: 

StratProf ~ + P x Choice x StratProf x StratProf. 


Example 5 Here are the pictures of three strategy profiles associated with the game 


5 





of Figure [IJ 
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The start of those strategy profiles is taken hy player A and she chooses right in 
the two first strategy profiles and down in the third strategy profile. 

Si is built with the strategy profiles: 


r 



1,8 
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Sl2 = 
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with 

Si = ((/4,r, sii,si 2 )). 

The full decomposition of sn is 

sii = ?') ((6j d, {{a 1, 6 8)), {{a 4, 6 7)))), {{A h> 2, 6 n- 0)))). 

Example 6 Figure [7] on page \1‘A Figure [H on page \1‘A and Figure 0 on page \14\ 
give strategy profiles of infinite sequential games. 

Example 7 

V J' h )' h ■ I' - 

0,100 95,0 -5,95 90,-5 -10,90 85,-10 -5n,100-5n 100-5(ti + 1) ,-5n 

is a strategy profile which is of interest in the dollar auction game \2A^ for proving 
that agent reasoning formally can enter escalation. 

Example 8 



is a strategy profile of the game of Example 0 and is solution of the equation 

s = ((A,d,{{B,r,s,s)),{{B,r,s,s)))) 

From a strategy profile, we can build a game by removing the choices: 

game :: StratProf Game 
game(((/))) = (/) 

game(((p,c, Sd,s,.))) = (p, game(sd), game(sr)) 

game(s) is the game of the strategy profile s. Notice that the function game is not 
recursive like say the function fib 

fib(O) = 0 
fib(l) = 1 

fib(n + 2) = fib(n + 1) + fib(n) 
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which defines the Fibonacci sequence. In fact, game is corecursive since it works on 
potentially infinite structures (see the end of this section) whereas fib which works 
on natural numbers works on finite structure and is recursive. Notice however that 
if we consider only finite games and finite strategy profiles, then a recursive game 
function could be defined. In this paper we consider the final coalgebras Game which 
contains finite and infinite sequential games and the final coalgebra Stratprof which 
contains finite and infinite strategy profiles. 

Given a strategy profile s, we can associate, by induction, a (partial) payoff 
function s', as follows: 

when s = ((/)) s = f 

when s = {{p,d,Sd,Sr)) 's = Sd 

when s = {{p,r, Sd, Sr)) s' = Sr 

In the literature on extensive games (HHg for instance), authors have a graph 
vision and they write fip(s) the payoff obtained by p starting at vertex v. Here 
we consider only the start vertex (let us call it start). For us, the vertices are 
not primitive objects. The other vertices are start vertices of subgames and of 
subprofiles and are only considered when dealing with those subgames and those 
subprofiles. What we write 's{p) would be written /ip*“''*(s) in [Sum- 

Example 9 In Example\^ we have: 

si (A) = 3 s'2{A) = 3 sl(H) = 2 

Si{B) = 2 51(5) = 6 ssiB) = 0 

s' is not defined if its definition runs in an infinite process. For instance, in Exam¬ 
ple [161 sllili is not defined and in Section |6l s^^ is not defined. To ensure that we 
consider only strategy profiles where the payoff function is defined we can impose 
strategy profiles to be convergenl^ written s (or prefixed j, (s)) and dehned as 
the least predicate satisfying 

s| < 7 > S = ((/)) V {s = {(p,d,Sd,Sr)) Asd i) \/ {s = {{p,r,Sd,Sr)) ASr i). 


Proposition 10 If s f, then s' is defined. 

Proof: By induction. If s = ((/)), then since 's = f and / is defined, s is 
defined. 

Assume s If s = ((p, c, Sd, Sr)), there are two cases: c = d or c = r. 

Let us look at c = d. If c = d, then Sd i and Sd is defined by induction 
and since s' = Sd, we conclude that s' is defined. 

The case c = r is similar. □ 

As we will consider the payoff function also for subprofiles, we want the payoff 
function to be defined on subprofiles as well. Therefore we define a property stronger 
than convergence which we call strong convergenc^ We say that a strategy profile 
s is strongly convergent and we write it s JJ- if it is the largest predicate fulfilling the 
following conditions. 

• ((P,c,sd,s^)) JJ. if 

— ((p, c, Sd, Sr)) is convergent. 


^Called leads to a leaf in m- 
'^Called always lead to a leaf in m- 



— Sd is strongly convergent, 

— Sr is strongly convergent. 

• ((/)) -D- that is that for whatever /, ((/)) is strongly convergent 
More formally: 

s 1| s = ((/)) V (s = {(p, c, Sd, Sr)) A s 4. ASd -II ASr -II). 

There is however a difference between the definitions of 4, and 4|. Wherever 
s I is defined by inductioT^ from ending games to the game s, s .|| is defined by 
coinductioT^ This difference between recursive and corecursive definition is the 
core of coalgebra theory [nisniisi]. However, this is not the aim of the present 
paper to present coinduction in detail. Both induction and coinduction are based on 
the fixed-point theorem. The definition of -l| is typical of infinite profiles and means 
that -1| is invariant along the inhnite profile. To make the difference clear and explicit 
between the definitions, we use the symbol <(!)> for inductive definitions and the 
symbol <©> for coinductive definitions. Recall that the definition of the function 
game :: StratProf —Game was presented as a coinductive function. We get easily 
the following proposition. 

Proposition 11 s -l| ^ s 4^ • 

Clearly we do not have the opposite implication, as shown by Example 1171 Indeed 
s 4^ is a local property whereas s -1| is a somewhat global property. 

We can define the notion of subprofile, written 

s' s <©> s' S V S = ((p, C, Sd, Sr)) A (s' Sd V s' Sr), 

where is the bisimilaritj0 among profiles defined as the largest binary predicate 
s's such that 

s'-s s s' = ((/)) = s V (s' = {{p, c, s'd, s'r)) A 

S = {{p,C,Sd,Sr)) A 
s'd Sd A s'r Sr)- 

Notice that since we work with infinite objects, we may have s 'fs s' and s A g' ^ g. 
In other words, an infinite profile can be a strict subprofile of itself. This is the 
case for si^o.a and si,o,& in Section |4l If a profile is strongly convergent, then its 
subprofiles are strongly convergent as well and the payoffs associated with all its 
subprofiles are defined. 

Proposition 12 

1. If Si -l| and S 2 ^ Si then S2 -1|. 

2. If Si -l| and if S 2 si, then S 2 is defined. 

^Roughly speaking a definition by induction works from the basic elements, to the constructed 
elements. For the natural numbers, for 0 and from n to n-|- 1. For finite strategy profiles, for ((/)) 
and from si and S 2 to ({p, c, si, 32 ))- 

®Roughly speaking a definition by coinduction works on infinite objects, like an invariant. 
^The reader can consider as the equality on StratProf. 
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2.2 The always modality 

We notice that 4- characterizes a profile by a property of the start vertex, we would 
say that this property is local. IJ. is obtained by distributing the property along the 
game. In other words we transform the predicate and such a predicate transformer 
is called a modality. Here we are interested by the modality always, also written □. 

Given a predicate $ on strategy profiles, the predicate □ d> is defined coinduc- 
tively as follows: 

□ d>(s) $(s)As = ((p,c,Sd,Sr-)) ^ (□$(sd)An$(sr)). 

The predicate “is strongly convergent” is the same as the predicate “is always 
convergent”. 

Proposition 13 s JJ- □ 4- (s). 

2.3 Strategies 

The coalgebra Strat of strategic.^ is defined by the functor 
11 : X R'" + (P + Choice) x X x X 

where P = {A, B} and Choice = {d,r}. In other words, the coalgebra Strat of 
strategies is solution of the equation: 

Strat ~ K.^ + (P + Choice) x Strat x Strat. 

A strategy of agent p is a game in which some occurrences of p are replaced by 
choices. A strategy is written |/| or |a:, si, S 2 l- 

Example 14 Consider the strategy for A in the game gg in which A decides to 
always take the choice r. 



By replacing the choice made by agent p by the agent p herself, the function 
st2g associates a game with a pair consisting of a strategy and an agent: 

st2g(|/l,p) = (/) 

st2g{lx,sti,st2l,p) = if a: € P then (x, st2g(sti,p), st2g(st2,p)) 

else (p, st2g(sti, p), st 2 g(st 2 , p)) ■ 

If a strategy st is really the strategy of agent p it should contain nowhere p and 
should contain a choice c instead. In this case we say that st is full for p and we 
write it st 'f. 

I/lt 

|x, sti, st2l t" (x ^ Choice ^ X 7 ^ p) A sti t'A st2 t’- 

strategy is not the same as a strategy profile, which is obtained as the sum of strategies. 


10 











We can sum strategies to make a profile. But for that we have to assume that all 
strategies are full and have the same game. We say that the strategies are consistent. 
In other words, (stp)pgp is a family of strategies such that: 

• Vp G P, stp f, 

• there exists a game g such that for all p in P, st2g returns g, more formally 
3g G Game,Vp G P,st2g(stp) = g. 

We define the sum 0 stp of consistent strategies as follows: 
peP 


01/1 = (if)) 

p£P 

|c, 0 0 Ip, stp, 1, stp, 2I = {{p',c, 0 1 7 0 Stp, 2 )). 

p€P\p' pGP pGP 

We can show that the game of all the strategies is the game of the strategy profile 
which is the sum of the strategies. 

Proposition 15 st2g(stp',p') = game(^^stp). 

pGP 


3 Infinipede games and the 0,1-game 

We will restrict to simple games which have the shape of combs. 





fi /2 h fi /s /e 


At each step the agents have only two choices, namely to stop or to continue. Let 
us call such a game, an infinipede. 

We introduce infinite games by means of equations. Let us see how this applies 
to define the 0,1-game. First consider two payoff functions: 

/o,i = A 0, B G- > 1 
/i,o = A 1, B 0 

we define two games 

90,1 = (A, (/o,i), Pi.o) 

9i.o = (B, (/i,o), ffo.i) 

This means that we define an infinite sequential game po,i in which agent A is the 
first player and which has two subgames: the trivial game (/i,o) and the game pi,o 
defined in the other equation. The game po,i can be pictured as follows: 
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or more simply in Figure [TJa. 

From now on, we assume that we consider only strategy profiles s whose game 
is the 0,1-game, that is game(s) = go,i- They are characterized by the following 
predicates 

So(s) <i> s = ((A,c, ((/o,i)),s')) 

Si(s) s = ((B,c, ((/i,o)),s')) ASo(s'). 


Example 16 Here is a strategy profile (Figure\^ 

Snr = {{A, r, ((/o,i)), ((6, r, ((/i,o)), Snr)))) 

where both agents continue forever. Notice that So(snr), “'(snr 4^) o- fortiori 
“'(■Snr -I)-)- Said in words, 

1. Snr has game go^i, 

2. Snr is not convergent, 

3. Snr is not strongly convergent. 

Example 17 Consider now the strategy profile (Figure\^ 

Sdnr = {{A,d, ((/o,i)), ((/i,o)),.Snr))))- 

This time So(sdnr): Sdnr i and -'{sddnr -11-)■ Said in words, 

F Sdnr has game gop^ 

2. Sdnr is convergent, 
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<?. Sdnr is not strongly convergent. 

We have SdU^(-4) = 0 and Sddi^(s) = 1. But is not strongly convergent 
since {{B,r, ((/i,o ))5 Snr)) is not convergent. 

Notice that the 0, l-game we consider is somewhat a zero-sum game, but we are 
not interested in this aspect. Moreover, a very specific instance of a 0,1 game has 
been considered (by Ummels [38] for instance), but these authors are not interested 
in the general structure of the game, but in a specific model on a finite graph, which 
is not general enough for our taste. Therefore for Ummels the 0,1-game is not a 
direct generalization of finite sequential games (replacing induction by coinduction) 
and not a framework to study escalation. 


4 Subgame perfect equilibria 

Among the strategy profiles, we can select specihc ones that are called subgame 
perfect equilibria [35]. Subgame perfect equilibria are specific strategy profiles that 
fulfill a predicate SPE. This predicate relies on another predicate PE which checks 
a local property. 

PE(s) O S IJ. A S = {{p,d, Sd, Sr)) ^ Sd{p) > Srip) 

A s = {{p, r, Sd, Sr)) => ^(p) > s'dip) 

A strategy profile is a subgame perfect equilibrium if the property PE holds always: 

SPE = nPE. 


Example 18 In Example\^we have SPE(si), SPE(s 2 ) and -iSPE(s 3 ). In Exam- 
ple[T3 -'SPE(snr) since -'(snr) U-- 

We may now wonder what the subgame perfect equilibria of the 0,1-game are. We 
present four of them in Figure [T]b, Figure [T]c and Figure |3l But there are others. 
To present them, let us define a predicate “A continues and B eventually stops” 

AcBes(s) s = ((p,c, ((/)), s')) ^ (p = A A / = /o,i A c = r A AcBes(s')) V 

(p = B A / = fifi A {c = dV AcBes(s')) 


Proposition 19 (Si(s) V So(s)) AcBes(s) ^ s = /i,o 

Proof: Assume Si(s) V So(s). If s = ((p, c, ((/)), s')), then So(s') V Si(s'). 
Therefore if AcBes(s'), by induction, s' = fi^. By cases: 

• lfp=AAc = r, then AcBes(s') and by definition of s', we have 
s = s' = fop 

• if p = B A c = d, the s = ((/i,o)) = /i,o- 

• if p = BAc = r, then AcBes(s') and by definition of s', s'= s' = /i.o- 

□ 

Like we generalize PE to SPE by applying the modality □, we generalize AcBes into 
SAcBes by stating: 

SAcBes = nAcBes. 
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There are at least two profiles which satisfies SAcBes namely si^o,a and Si^o.fe which 
have been studied in |20j and pictured in Figure [TJ 

SpO.a ((A,r, ((/o,l)),Sl,0,b)) si,o,b ((B, d, {(/l,o)), Sl,0,a)) 

So.l.a <©> ((A, d, ((/o,l)), So.pb)) So,!,!) <Q> ((B, r, ((/i,o)), SO,l,a)) 

In Figure [U we give other strategy profiles which fulfill the predicate SAcBes. For 
the first one we draw only the beginning of the strategy profile, but the reader can 
imagine that he continues a strategy profile in which A always continues whereas B 
does not always continue, in other words, B stops infinitely often. 



Proposition 20 SAcBes(s) => s IJ- . 

We may state the following proposition. 

Proposition 21 Vs, (So(s) V Si(s)) ^ (SAcBes(s) SPE(s)). 

Proof: Since SPE is a coinductively defined predicate, the proof is by 
coinduction. 

Given an s, we have to prove Vs, □AcBes(s) A (So(s) V Si(s)) => 
□ PE(s). 

For that we assume □AcBes(s) A (So(s) V Si(s)) and in addition 
(coinduction principle) □PE(s') for all strict subprofiles s' of s and we 
prove PE(s). In other words, s JJ- A{{p,d,Sd,Sr)) => Sd{p) > Srip) A 

{{p, r, Sd, Sr)) Sr{p) > s'dip). 

By Proposition [201 we have s JJ-. 

By Proposition [THl we know that for every subprofile s' of a profile s 
that satisfies Si(s)VSo(s) we have s' = /i^o except when s' = ((/o,i))- Let 
us prove {{p, d, Sd, Sr)) ^ s'dip) > 'V(p) A {{p, r, Sd, Sr)) ^ Srip) > Sdip). 
Let us proceed by case: 

• s = ((A,r, ((/o,i)), s')). Then So(s) and Si(s'). Since □AcBes(s), we 
have AcBes(s'), therefore s' = fi^ hence s'(a) = 1 and /o,i(a) = 0, 
henceforth s'(a) > /o,i(a). 
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• s = ((B,r, {{fi.o)),s')). Then Si(s) and So(s'). Since □AcBes(s), we 
have AcBes(s'), therefore s' = /i,o hence s'(b) = 0 and /i,o(b) = 0, 
henceforth s'(b) > /i_o(b). 

□ 

Symmetrically we can define a predicate BcAes for “B continues and A eventually 
stops” and a predicate S BcAes which is S BcAes = □ BcAes which means that B 
always continues and A stops infinitely often. With the same argument as for 
SAcBes we can conclude : 

Proposition 22 Vs, (So(s) V Si(s)) ^ SBcAes(s) => SPE(s). 

Lemma 23 Assume So(s) or Si(s), then SPE(s) ^ (SAcBes(s) V SBcAes(s)) 

Proof: By contradiction. Assume (-iSAcBes(s)) A (-'SBcAes(s)). This 
means that one of the following statements are fulfilled. 

• There exist sg and s^ such that s^ = ((A, d, ((/o,i)), s^)) sg = 

((6, d, {{/i^o))i Sg)) s and there are only “r’s” between B and 

A. Notice that Sg(B) = 1 while ((/i,o))(b) = 0. Since SPE(s) 
then SPE(sb) therefore sg = ((B, d, ({/i,o)), Sg)) implies Sg(B) < 
((/i.o))(b) which is a contradiction. 

• There exist s^ and sb such that sb = ((6, d, ((/i,o)), s^)) s^ = 

((A, d, ((/o,i)), Sg)) s and there are only “r’s” between A and B. 

The contradiction is obtained like above. 

• Snr ^ Si which means that eventually A and B continue forever 
and which is in contradiction with SPE(s) since -iSPE(sni.) (see 
Example fTSl). 

□ 

SAcBes V SBcAes fully characterizes SPE of 0,1-games, in other words. 

Theorem 24 Vs, (So(s) V Si(s)) (SAcBes(s) V SBcAes(s) <^=> SPE(s)). 

5 Nash equilibria 

Before talking about escalation, let us see the connection between subgame perfect 
equilibrium and Nash equilibrium in a sequential game. In |26j . the definition of 
a Nash equilibrium is as follows: A Nash equilibrium is a “pattern[s] of behavior 
with the property that if every player knows every other player’s behavior she has 
not reason to change her own behavior” in other words, “a Nash equilibrium [is] 
a strategy profile from which no player wishes to deviate, given the other player’s 
strategies. ” . The concept of deviation of agent p is expressed by a binary relation 
we call convertibilit]J^ and we write I-pH. It is defined inductively as follows: 



sI-pHs' 

SiPpHs^]^ S2pPHs2 
{{p,C,Si,S2)) I-pH ((p,c',Si,S2)) 
^This should be called perhaps feasibiliti/ following |29| and m 
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Si 1“ p H S21“ P H S2 


{{p',C,Sl,S2)) I-pH ((p ,C,si,S2)) 
We define the predicate Nash as follows: 


Nash(s) Vp, Vs', s I-pHs' ^ s{p) > s'{p'). 


The concept of Nash equilibrium is more general than that of subgame perfect 
equilibrium and we have the following result: 

Proposition 25 SPE(s) => Nash(s). 

The result has been proven in COQ and we refer to the script ('see|22)b 

http://perso.ens-lyon.fr/pierre.lescanne/COQ/EscRatAI/ 
http://perso.ens-lyon.fr/pierre.lescanne/COQ/EscRatAI/SCRIPTS/ 
Notice that we defined the convertibility inductively, but a coinductive definition 
is possible. But this would give a more restrictive definition of Nash equilibrium. 

6 Escalation 

Escalation in a game with a set P of agents occurs when there is a tuple of consistent 
strategies (stp)pgp such that its sum is not convergent, in other words, -i stp) 4 ,. 


peP 


Said differently, it is possible that the agents have all a private strategy which 
combined with those of the others makes a strategy profile which is not convergent, 
which means that the strategy profile goes to infinity when following the choices. 
Notice the two uses of a strategy profile: first, as a subgame perfect equilibrium, 
second as a combination of the strategies of the agents. 

Consider the strategy: 


S^A.oo = IcI/0,ll,s4,ool 

= IB, I/l,0l,siA.ool 


and its twin 


S^B.oo = [A, 


Moreover, consider the strategy profile: 


^A.cxd ((y*0,l)); '®B,c>d)) 

^B.CXD ((^ 1 , 0 )) ; '^A,C»)) * 


Proposition 26 





3. St2g(st4^^, A) = st2g(ste.oo, B) = 50,1 

4- game(s4_oo) = 50.1, 

5. 0 QQ ^A,C!o; 
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6 - “■ S4,oo i- 


Proof; The first statements are proved by coinduction on the definition 
of f, st 2 g, game and The last statement is by induction on the 

definition of □ 

Proposition [26l can be said in words as follows: 

1. sfA.oo is full for A, 

2. sfe.oo is full for B, 

3. sfA,oo and sfe.oo have game go,i, 

4. SA,oo has game 50 , 1 , 

5. strategy ste.oo plus strategy ste.oo yields profile sa.ooi 

6 . profile sa 00 is uot convergent. 

s^A.oo and sfB,oo are both correct since they are built using choices, namely r, 
dictated by subgame perfect eauilibrigT^ which start with r. Another feature of 
0 , 1 -game is that no agent has a clue for what strategy the other agent is using. 
Indeed after k steps, A does not know if B has used a strategy derived of equilibria 
in SAcBes or in SBcAes. In other words, A does not know if B will stop eventually or 
not and vice versa. The agents can draw no conclusion of what they observe. If each 
agent does not believe in the threat of the other she is naturally led to escalation. 

7 Relativity: are agents really rational? 

Observers are less eognitively busy and more open to 
information than aetors. 

Daniel Kahneman m 

Rationality as observed by an outsider is not the same as rationality seen by an 
insider [a [mu]. Since coalgebras are the mathematical tool for observation m 
a coalgebra approach does not come as a surprise. 

In this paper we would like to use the following definition: “An agent is rational^ 
if she is motivated by maximizing her own payoff”, but since this leads to debate, we 
prefer to say that an agent is intelligent in this case. In infinite extensive games, this 
translates in saying that an agent is intelligent if she adopts a strategy consistent 
with a subgame perfect equilibrium as an extension of backward induction. This 
explains the behavior of traders especially when they enter escalation. But is this 
compatible with common sense? More precisely whereas Howie Hubler (who lost 
8.67 X I0®$ for Morgan Stanley), Jerome Kerviel (who lost 6.95 x I0®$ for Societe 
G&CTale) or Brian Hunter (who lost 6.69 x I0®$ for Amaranth Advisors) were 
intelligent agents when they acted, they are seen clearly as stupid by an external 
observer. In other words, agents reason intelligently in an irrational escalation, 
which means that an agent can be rational in her closed world and seen irrational 
from outside. Is this consistent? Yes and this allows us drawing three conclusions: 

• Insider view differs from outsider observation. 

Recall that our concept of intelligent choice is that of a subgame perfect equilibrium, as 
it generalizes backward induction, which is accepted following Aumann [3] as the criterion of 
rationality for finite game. 

^^seen from inside 
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• According to K. E. Stanovich |351136] . there are two levels in Kahneman Sys¬ 
tem II (the effortful and slow part of the mind) |14j : algorithmic mind which 
is the ability of agents to reason perfectly and logically in a deductive sys- 
teirP^ (called mindware by Keith Stanovich) and reflective mind (or epistemic 
mind) which is the ability of an agent to reconsider her believes. For instance, 
reflective mind is visible when the agent realizes that the world (the game) is 
finite or when the agent realizes that maximizing her own profit is no more the 
main aim and that the survival of her company should be taken into account. 
In both cases, she changes her belief in an infinite world or in maximizing her 
own profit and adds or modifies one or more axiom(s) founding her deductive 
system, as part of her belief. 

• Consistently with the previous statement, an agent who is involved in an es¬ 
calation can be considered as having a short term vision, whereas the observer 
has a long term vision. The agent can also be considered as having a local 
vision, whereas the observer has a global vision. 

8 Mindware revisited 

We propose to extend the concept of mindware proposed by Stanovich which itself 
refines system II (slow thinking) of Kahneman [Tl]. For Stanovich, the deductive 
system of the mindware is always the same, namely classical first order logic, only 
its implementation in the mind of the agent may vary and can be incomplete. The 
reasoning of the mindware relies on believes that can be changed by the reflective 
mind. For us, the mindware may implement several kinds of deductive systems 
and may change from one agent to the other. For instance, it can implement 
classical first order logic, intuitionistic first order logic, classical or intuitionistic 
first order logic extended with inductive reasoning or extended with coinductive 
reasoning, higher order logic (intuitionistic or classical) etc. All those logics are 
equally acceptable, because they are consistent and from Gddel theorem we know 
that no universal system of deduction exists. Like for Stanovich, believes may also 
be changed by the reflective mind. In his book What Intelligence Tests Miss: The 
Psychology of Rational Thought [35) Stanovich presents a few examples to set his 
point. The fact that the mindware may implement several deductive systems leads 
to reconsider the scope of his examples. This is the case for the example that 
illustrates the beginning of chapter six: 

Jack is looking at Anne hut Anne is looking at George. Jack is mar¬ 
ried but George is not. Is a married person looking at a unmarried 
person? 

A) Yes B) No G) Gannot he determined 

Answer A, B, or G. 

[...] 

The vast majority of people answer G. 

In others words, the vast majority of people, me included, show on this example 
that they have a correct mindware based on a constructive logic, for instance, on 
intuitionistic logic, because such a logic is easier to use. Therefore they answer C. 
Stanovich claims that such an answer is incorrect and calls us cognitive misers. 
Actually when reading further [35] I understood that I was supposed to use a more 
sophisticated logic, especially that I should use the excluded middle, in other words, 
that I should use classical logic, because “most people can carry fully disjunctive 
reasoning when they are explicitly told that it is necessary” ( |35) p. 71). Therefore 

infinite game theory, the deductive system includes coinduction and SPE’s. 
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I changed my belief by adding p or not p and I answered A. But this requires more 
calculation. Indeed p or not p can be specialized into Anne is married or Anne is 
not married from which we draw Anne is married and George is not married and 
Anne is looking at George or Jack is married and Anne is not married and Jack is 
looking a Anne. We can abstract this into there exists x and there exists y such that 
X is married and y is not married and x is looking at y or there exists x and there 
exists y such that x is married and y is not married and x is looking at y which 
simplifies into there exists x and there exists y such that x is married and y is not 
married and x is looking at y and leads to answer A. However if the above question 
is completed by 

Choose one of the followings 

A) Anne is married 

B) Anne is not married 

C) I don’t know. 

A rational person including a person who answered A at the first question will 
answer C at this question. More precisely, answering A at the first question does 
not allow the agent to justify her answer by exhibiting a pair of persons such that 
one is married and looking at the other who is not married. Unable to justify their 
answer, the majority of people choose C at the Hrst question as well. 

To introduce disrationality, chapter two of [35] starts with the case of John Allen 
Panics who was involved in a classical escalation [35]. Actually John Allen Paulos, 
professor of mathematics at Temple University, is a typical person with a sound 
algorithmic mind using coinduction knowingly or no10. Actually he was disrational 
because he used improperly his reflective mind and did not change his belief in an 
“everlasting” WorldCom company (which eventually bankrupted in 2002) and in its 
eventual restart. Paulos is obviously “foolish”, if we say that someone is foolish, if 
he has a perfect algorithmic mind, but a faulty reflective mind, in other words if he 
is intelligent, but not rational. 

The above comments strengthen Stanovich’s distinction between algorithmic 
mind and reflective mind, but make the delineation of rationality harder and its 
evaluation difficult, because attributing a “rationality quotient” requires first to 
determine the deduction system implemented in the mindware and its strength, 
then to appreciate the ability of the agent of changing her believes. 

9 Which deductive system? 

I say that it is not illogical to think that the world is 
infinite. 

Jose Luis Borges, The library of Babel in |3] 

We noticed that several deductive systems can be used. We may wonder what 
features a deductive system should have. Let us tell some of them. First the lan¬ 
guage should contain a modality to enable agents to express belief {Bg) or knowl¬ 
edge (Kg). Moreover, the excluded middle (pV ->p) is clearly not mandatory. How¬ 
ever an agent a should be able to state that she believes in the excluded middle by 
a statement like 

Baii'^P ■ Proposition) pV -ip). 

Actually Paulos uses coinduction unknowingly, rather invoking a kind of invariant 
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This requires a quantification over propositions, which allows also expressing sen¬ 
tences like: We know there are known unknowns (see m Section 4): 

f\ Ka{{3p : Proposition) Ka{-^Ka{p))). 

a^Agent 

Besides quantifications over propositions, quantifications over functions and sets are 
required to express belief in finiteness ([50] Section 6): 

Baii^A : Set) (V/ : A ^ A) {Surjective{f) Injectve{f))) 

where Surjective is a predicate which asserts surjectivity and Injective is a predicate 
which asserts injectivity. 


10 Conclusion 

In this paper, we have shown how to use coinduction in economics, more precisely in 
economic game theory where it has not been used yet, or perhaps in a hidden form, 
which has to be unearthed. We have shown also that rational agents can be seen 
as irrational by observers since observation changes the point of view, in particular 
on rationality. When Wolfgang Leininger writes (see citation in front of Section |T]) 
that the fact that rational agents should not engage in an escalation seems obvious, 
he means, “obvious” for an observer, not for the agents, or perhaps he should have 
said that rational agents should not engage in an escalation, but that intelligent 
agents could. If agents are only intelligent, the efficiency of the markets should 
then be revisited at the light of escalation. Therefore coinduction is a possible way 
for rethinking economics. 
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A Finite 0,1 games and the “cut and extrapolate” 
method 

We spoke about the “cut and extrapolate” method, applied in particular to the 
dollar auction. Let us see how it would work on the 0,1-game. Finite games, finite 
strategy profiles and payoff functions of finite strategy profiles are the inductive 
equivalent of infinite games, infinite strategy profiles and infinite payoff functions 
which we presented. Notice that payoff functions of finite strategy profiles are 
always defined. Despite we do not speak of the same types □ of objects, we use the 
same notations, but this does not lead to confusion. Consider two infinite families 
of hnite games, that could be seen as approximations of the 0,1-game: 

Fo,l = (a, (/o,l), (B, (/i,o), ^04)) U {(/o,i)} 

Ko.i = (A,(/o.i),K'_i) 

Fq.i = (b, (/qo), ATo,!) u {(/i^o)} 

the sense of type theory 
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In Fq.i we cut after B and replace the tail by (/o,i)- In ^o.i we cut after A and 
replace the tail by (/i,o)- Recall [39] the predicate backward induction shortened in 
Bl, which is the finite and inductive version of PE. 

Bl((./)) 

Bl((p, c, SdjSr-)) = Bl(sz) A Bl(sr) A 

((p, d, Sd, Sr)) => s'dip) > frip) A 
((p, r, Sd, Sr)) ^ ^r{p)> ^d{p) 

Example 27 In Example\^we have Bl(si), Bl(s2) and -iBI(s3). 

We consider the two families of strategy profiles: 


SFo,i(s) (s = ((A,d,((/o,i)),((B,r,((/i,o)),s'))))ASFo,i(s')) V 

(s = ((A, r, ((/o,i)), {{B, r, ((/i.o)), s')))) A SFo.i(s')) V 
s = {{hi)) 

SKo,i(s) s=((A,r,((/o.i)),s'))ASK'^i(sO 

SKo i(s) <(T)> (s = ((B,d, ((/i,o)),s')) V s = {(B,r, ((/i,o)),s'))) A SKo,i(s') V 
s = ((/i.o)) 

In SFg 1 , B continues and A does whatever she likes and in SKg^, A continues and 
B does whatever she likes. The following proposition characterizes the backward 
induction equilibria for games in Tg^i and Kq i respectively and is easily proved by 
induction: 

Proposition 28 

• game(s) e Fg^ A SFg^i(s) Bl(s), 

• game(s) G Kop A SKg,i(s) O Bl(s). 

This shows that cutting at an even or an odd position does not give the same 
strategy profile by extrapolation. Consequently the “cut and extrapolate” method 
does not anticipate all the subgame perfect equilibria. Let us add that when cutting 
we decide which leaf to insert, namely (/o,i) or (/i,o), but we could do another way 
obtaining different results. 

0,1 game and limited payroll. To avoid escalation in the dollar auction, people 
require a limited payroll, i.e., a bound on the amount of money handled by the 
agents, but this is inconsistent with the fact that the game is infinite. Said otherwise, 
to avoid escalation, they forbid escalation. We can notice that, in the 0,1-game, a 
limited payroll would not prevent escalation, since the payoffs are anyway limited 
by 1. In the same vein, Demange m adds, to justify escalation, a new feature 
called joker which is not necessary as we have shown. 
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